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We discuss data compression for CMB experiments. Although "radical compres- 
sion" to Ci bands, via quadratic estimators or local bandpowers, potentially offers 
/»f\ 1 a great savings in computation time, they do a considerably worse job at recovering 

the full likelihood than the the signal-to-noise eigenmode method of compression. 



^ ■ We model a CMB observation at a pixel p = 1 . . . Np, as Ap = Sp + Up, 

0^ ! where s and n represent the contribution of the CMB signal and the noise, 

^— ^ ■ respectively, to the observation. The signal is given by Sp — J-pemO'£m', o.(.m is 

the spherical harmonic decomposition of the temperatm'e and T encodes the 
f-^ ' beam and any chopping strategy of the experiment. 

f~^ , We also assume that both the signal and noise contributions are described 

0^ ' by independent, zero-mean, gaussian probability distributions, with correlation 

r~| . matrices given by {spSp>) = Cxpp' and {upUpi) = Cnpp', so (ApAp/) = Cxpp' + 

Qh' Cnpp' ; here, Ct = Ct{0) is calculated as a function of 9, the parameters of the 

theory being tested in the likelihood function, and the noise matrix can include 
^ ■ the effect of constraints due to, e.g., average or gradient removal. For Gaussian 

c/3 , theories of adiabatic fluctuations, 6 is typically the cosmological parameters; 

Cd ■ alternately they could be some set of phenomological parameters such as the 

value of the temperature power spectrum Cg in some bands. 
With this notation, the likelihood function is 



C^ ' exp 



-\Ap(CT(e) + Cn)-'\' 



Calculating this requires extensive manipulations on the total correlation ma- 
trix Ct{0) + Cn over extensive portions of the parameter space. Herein lies 
the problem: in order to calculate the determinant factor in the denominator 
requires time of 0{Np). 

We can state the problem as follows: find some functions of the data, 
/» ({Ap}) such that the new likelihood, Cf{e) = P{f\e) ~ Ca[0) = P(A|6l) and 
Cf is "easier to calculate" in some appropriate sense. The definition of "~" in 
this expression is crucial. If we take it to mean "having the same variance," the 
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requirement reduces to the definition of "lossless" involving the Fisher matrixEl 
Note that this requirement will only be adequate very near the maximum of 
the distribution, i.e., where the Gaussian approximation is appropriate. For 
large data-sets, with high signal-to-noise, this will presumably be an adequate 
description; elsewhere (including those parts in the "tails" of an otherwise high- 
S/N experiment's window function that may be most interesting), it will not 
necessarily obtain that the derived confidence limits and parameter estimates 
will be the same. 

If we can find a basis in which the matrices Ct and C„ are diagonal, the 
likelihood computation simplifies from matrix manipulations to 0{N) sums 
and products. First, we whiten the noise matrix using the transformation 
provided by its "Hermitian square root," and apply the same transformation 
to Ct, diagonalizing this in turn with the appropriate matrix of eigenvectors, 
Ct — > diag(£fc), in units of {S/N)^. We then transform the data into the 
same basis, A ^- ^, in units of (S/N). In this basis, the noise and signal have 
diagonal correlations and (^^) = I + £k- Now, the likelihoocL function is a 
simple product of one-dimensional uncorrelated gaussians in ^fcH 

Modes with low £k are linear combinations of the data which probe the 
theory poorly. Thus, they are ideal candidates for removal in a data compres- 
sion scheme. Removal of these low-S/N modes is the optimal way to compress 
the data: for a given number of modes, it removes the most noise and least 
signal. Of course, the modes will change as the shape of the theory used for 
the covariance matrix Ct changes. In that case, we choose some fiducial the- 
ory (here, standard, untilted CDM) that represents the data moderately well, 
calculate its S/N-eigenmodes, and choose some number of modes to retain af- 
ter compression. For other theories, these modes will not be optimal — we will 
have removed more signal and less noise. Nonetheless, this method does quite 
well even far from the fiducial theory, as we see in the Figure, which shows 
the likelihood for a parameterization of a standard CDM power spectrum with 
amplitude, erg, and a scalar tilt, ns (so the primordial P{k) oc fc"=). Unfortu- 
nately, finding this basis is 0{N^); even performing this operation once for a 
megapixel dataset will be prohibitively expensive. 

Most experiments report their results in the form [Ci ± 6Ci] ; this encour- 
ages the use of a simple curve-fitting approach to parameter estimatiom: form 

r n 2 

the obvious quantity x^(^) = J2e ^t ~ ^ei^) I i^^i) where Ce{0) is the pre- 
dicted power spectrum for the theoretical parameters 9. In practice, the power 
spectrum is usually reported as a flat bandpower over some £ band with an 
appropriate window function, but the procedure remains the same. 

Now, just do the usual fast ^^-minimization for the parameters. This is 
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Figure 1: Contours of constant likelihood ratio (at 52 In £ = 1, 4, 9, . . .) for the SK94-5 data, 
parameterized by eg and ris, the primordial spectral slope, for different methods of data 
compression. The light-colored contours in each panel show the uncompressed likelihood. 



a very "radical" approximation to the full likelihood; it assigns an uncorre- 
lated gaussian distribution to the power spectrum: C{9) oc exp(— x^/2). In 
the figure, we show confidence intervals for the SK95 experiment with the full 
likelihood and compare them to those obtained with 1) the S/N compression 
discussed above; 2) x^ using fiat bandpowers as reported by Netterfield et alp; 
3) x^ using a quadrjatic estimator of Ci in bands of I, modified somewhat to 
minimize covarianccfl. The peaks are nearby, and the contours are similar in 
the "amplitude" direction (as), but less good in the "shape" {us) direction. 
Far from the peak, all of the y^ methods do quite poorly. The location of the 
peak is determined by the gross shape of the power spectrum, so the details 
of the calculation do not matter; this is especially true when combining the 
results of experiments probing very different scales, such as COBE/DMR and 
SK95. Away from the peak, the shape of the spectrum within the experimen- 
tal windows, the covariances of the errors, and the non-Gaussian shape all 
contribute to the determination of which theories are more highly disfavored. 
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